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COMPUTER-IMPLEMENTED METHOD, SYSTEM AlVD PROGRAM 
PRODUCT FOR ANALYZmG MESSAGES ASSOCIATED WITH COMPUTER 

PROGRAM CODE 

Cross-Reference to Related Applications 

[0001] This application is related to co-pending application entitled "Computer- 
implemented Method, System and Program Product for Determining a Message Quantity 
for Computer Prograni Code," filed October 9, 2003 and assigned attorney docket number 
RSW920030212US1, and to co-pending application entitled "Computer-implemented 
Method, System and Program Product for Reviewing a Message Associated with 
Computer Program Code," filed October 9, 2003 and assigned attorney docket number 
RSW920030213US1, both of which are herein incorporated by reference. 

Background of the Invention 

1. Field of the Invention 

[0002] The present invention generally relates to a computer-implemented method, 
system and program product for analyzing messages associated with computer program 
code. Specifically, the present invention determines information/metrics corresponding 
to the messages so that translation estimates can be computed. 
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2. Related Art 

[0003] As computers continue to become more integrated in everyday life, computer 
software continues to become more sophisticated. For example, today a computer user 
can prepare a tax return, pay bills, purchase goods or services, etc., from the comfort of 
his/her personal computer. Such convenience is provided by various software packages. 
To this extent, computer program developers are constantly working to improve existing 
products, or introduce new products to the market. 

[0004] In developing computer program code, a developer will usually include certain 
"messages" in properties files that are designed to support the product for the end-user. 
Typical messages that can be provided include error messages, warning messages and 
information messages. Although each of these messages serves a different purpose, they 
are usually necessary to fiiUy support the program code being developed. Unfortunately, 
there is currently no way for developers to readily determine information about the 
messages. Specifically, since messages are constantly being created, edited or deleted, 
there is very little way to know specific details about the messages. For example, a 
developer might wish to know the quantity of new messages that were created during a 
fixed time period, and the quantity of lines of new content that were included. Such 
details are especially difficult to ascertain given the large volume of program code 
components and properties files that can be involved with a single product. 
[0005] As a result of this predicament, program integrated information (PII) counts are 
usually inaccurate and highly inflated to try to accommodate some imknown number. 
This raises various concerns since translation between languages is often necessary for a 
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program. Specifically, it is common for a software product to be made available in 
multiple countries/regions of the world. As such, lingual translation of interfaces, 
messages and the like is necessary. The longer the messages that are utilized to support a 
program, the greater the translation costs. Thus, with no knowledge of the amount of 
content to be translated as messages are created, edited or deleted, translations costs are 
virtually impossible to accurately predict. 

[0006] In view of the foregoing, there exists a need for a computer-implemented method, 
system and program product for analyzing messages associated with computer program 
code. Specifically, a need exists for a system that can be configured to analyze certain 
types of messages. A fiirther need exists for the system to be configured to return certain 
pieces of desired information about the messages. A further need exists for the system to 
be able to determine an estimated line count within the analyzed messages. 

Summary of the Invention 

[0007] In general, the present invention provides a computer-implemented method, 
system and program product for analyzing messages. Specifically, under the present 
invention, an analysis criteria is provided. The criteria identifies computer program 
components, properties files and a predetermined time period so that specific messages 
can be identified. Thereafter, information desired to be obtained about the messages is 
designated. Typical types of desired information include a quantity and an estimated line 
count for certain types of messages (e.g., newly created, edited and/or deleted). Once this 
is designated, instances of the types of messages (e.g., specific messages fitting the 
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designated types) will be identified using the analysis criteria, and then analyzed to 
determine the desired information. The estimated line count can be used to estimate 
translation costs for the messages. 

[0008] A first aspect to the present invention provides a method for analyzing messages 
associated with computer program code, comprising: providing message analysis criteria, 
wherein the message analysis criteria identifies at least one computer program 
component, at least one properties file and a predetermined time period; designating 
desired information, wherein the desired information includes a message quantity and an 
estimated line count for at least one type of message; and identifying instances of the at 
least one type of message based on the analysis criteria, and analyzing the instances to 
determine the desired information. 

[0009] A second aspect to the present invention provides a system for analyzing 
messages associated with computer program code, comprising: a criteria system for 
providing message analysis criteria, wherein the message analysis criteria identifies at 
least one computer program component, at least one properties file and a predetermined 
time period; an information request system for designating desired information, wherein 
the desired information includes a message quantity and an estimated line count for at 
least one type of message; and an analysis system for identifying instances of the at least 
one type of message based on the analysis criteria, and for analyzing the instances to 
determine the desired information. 

[0010] A third aspect to the present invention provides a program product stored on a 
recordable medium for analyzing messages associated with computer program code, 
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which when executed, comprises: program code for providing message analysis criteria, 
wherein the message analysis criteria identifies at least one computer program 
component, at least one properties file and a predetermined time period; program code for 
designating desired information, wherein the desired information includes a message 
quantity and an estimated line count for at least one type of message; and program code 
for identifying instances of the at least one type of message based on the analysis criteria, 
and for analyzing the instances to determine the desired information. 
[00 11] Therefore, the present invention provides a computer-implemented method, 
system and program product for analyzing messages. 

Brief Description of the Drawin£S 
[0012] These and other features of this invention will be more readily understood from 
the following detailed description of the various aspects of the invention taken in 
conjunction with the accompanying drawings in which: 

[0013] Fig. 1 depicts a system for analyzing messages associated with computer program 

code according to the present invention. 

[0014] Fig. 2 depicts the message reporting system of Fig. 1. 

[0015] Fig. 3 depicts an illustrative criteria interface, according to the present invention. 
[0016] Fig. 4 depicts an illustrative information request interface according to the present 
invention. 

[0017] Fig. 5 depicts a flow diagram of a method for analyzing messages associated with 
computer program code according to the present invention. 
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[0018] The drawings are merely schematic representations, not intended to portray 
specific parameters of the invention. The drawings are intended to depict only typical 
embodiments of the invention, and therefore should not be considered as limiting the 
scope of the invention. In the drawings, like numbering represents like elements. 

Detailed Description of the Invention 
[0019] As indicated above, the present invention provides a computer-implemented 
method, system and program product for analyzing messages. Specifically, under the 
present invention, an analysis criteria is provided. The criteria identifies computer 
program components, properties files and a predetermined time period so that specific 
messages can be identified. Thereafter, information desired to be obtained about the 
messages is designated. Typical types of desired information include a quantity and an 
estimated line count for certain types of messages (e.g., newly created, edited and/or 
deleted). Once this is designated, instances of the types of messages (e.g., specific 
messages fitting the designated types) will be identified using the analysis criteria, and 
then analyzed to determine the desired information. The estimated line count can be used 
to estimate translation costs for the messages. 

[0020] Referring now to Fig. 1, system 10 for analyzing messages associated with 
computer program code is shown. As depicted, system 10 includes a computer system 12 
that contains computer program code 14 (hereinafter simply referred to as "program code 
14") and messages 16 as developed by one or more developers 15. As will be fiirther 
described below, message reporting system 18 is an executable program or the like that 
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analyzes messages 16 based upon analysis criteria to determine desired information. 
Typically, the desired information includes a quantity and an estimated line count for at 
least one type of messages. This information is useful in computing an estimated 
translation cost for newly created or edited messages. 

[0021] It should be understood that computer system 12 is intended to represent any type 
of computerized device capable of executing programs and performing the functions 
described herein. For example, computer system 12 could be a personal computer, a 
handheld device, a workstation, client, server, etc. In addition, computer system 12 could 
be implemented as a stand-alone system, or as part of a computerized network such as the 
Internet, local area network (LAN), wide area network (WAN), virtual private network 
(VPN), etc. In the case of the former, program code 14 and messages 16 could be 
provided to computer system 12 on a recordable medium such as a diskette or CD-ROM 
(e.g., by a developer 15). In the case of the latter, program code 14 and messages 16 
could be made available to computer system 12 over a network (e.g., via e-mail, file 
sharing, etc.). Furthermore, if implemented within a network, computer system 12 could 
represent a client or a server. As known, communication between clients and a server 
could occur via a direct hardwired connection (e.g., serial port), or via an addressable 
connection that may utilize any combination of wireline and/or wireless transmission 
methods. The server and the clients may utilize conventional network connectivity, such 
as Token Ring, Ethernet, WiFi or other conventional communications standards. 
Moreover, connectivity could be provided by conventional TCP/IP sockets-based 
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protocol. In this instance, the clients could utilize an Internet service provider to 
establish connectivity to the server. 

[0022] Regardless of its implementation, as shown computer system 12 comprises central 
processing unit (CPU) 20, memory 22, bus 24, input/output (I/O) interfaces 26, extemal 
devices/resources 28 and storage unit 30. CPU 20 may comprise a single processing unit, 
or be distributed across one or more processing units in one or more locations, e.g., on a 
client and computer system. Memory 22 may comprise any known type of data storage 
and/or transmission media, including magnetic media, optical media, random access 
memory (RAM), read-only memory (ROM), a data cache, etc. Moreover, similar to CPU 
20, memory 22 may reside at a single physical location, comprising one or more types of 
data storage, or be distributed across a plurality of physical systems in various forms. 
[0023] I/O interfaces 26 may comprise any system for exchanging information to/from an 
extemal source. Extemal devices/resources 28 may comprise any known type of extemal 
device, including speakers, a CRT, LCD screen, handheld device, keyboard, mouse, voice 
recognition system, speech output system, printer, monitor/display, facsimile, pager, etc. 
Bus 24 provides a communication link between each of the components in server 14 and 
likewise may comprise any known type of transmission link, including electrical, optical, 
wireless, etc. 

[0024] Storage unit 30 can be any system (e.g., database) capable of providing storage for 
information under the present invention. Such information could include, for example, 
program code, messages, translation costs, etc.. As such, storage unit 30 could include 
one or more storage devices, such as a magnetic disk drive or an optical disk drive. In 
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another embodiment, storage unit 30 includes data distributed across, for example, a local 
area network (LAN), wide area network (WAN) or a storage area network (SAN) (not 
shown). Furthermore, although not shown, additional components, such as cache 
memory, communication systems, system software, etc., may be incorporated into 
computer system 12. 

[0025] As indicated above, message reporting system 18 will analyze messages 16 that 
are associated with program code 14, and provide desired/requested information. 
Specifically, message reporting system 18 is capable of determining a quantity of 
messages created, edited and/or deleted, as well as an estimated line count for each such 
message type. This information can be displayed to developers 15 or the like in result 
interface 32. 

[0026] Referring now to Fig. 2, message reporting system 18 is shown in greater detail. 
As depicted, message reporting system 18 includes criteria system 40, information 
request system 42, analysis system 44 and output system 46. Criteria system 40 provides 
the mechanism by which message analysis criteria will be provided. Specifically, criteria 
system 40 typically generates a criteria interface through which developers 15 or the like 
can designate the criteria by which messages will be identified and analyzed. Referring to 
Figs. 3, an illustrative criteria interface 50 is shown in greater detail. In a typical 
embodiment, the message analysis criteria that will be provided includes at least one 
computer program component, at least one properties file and a predetermined time 
period, which can be provided using component field 52, property file field 54 and date 
boxes 58, respectively. This collective information identifies a specific "pool" of 
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messages that can be subject to analysis. That is, identifying computer program 
components identifies particular sections of program code 14 whose associated messages 
16 could be subject to analysis under the present invention. Identifying properties files 
identifies the precise files corresponding to the identified computer program components 
in which actual messages are located. For example, a single computer program 
component could have ten associated properties files. However, the developer may only 
be interested in potentially analyzing messages firom two of the ten properties files. The 
predetermined time period provides an exact date range of potential messages. For 
example, the developer might only be interested in messages that exist between certain 
dates. Using the illustrative information shown in Fig. 2, the developer wishes the pool 
of potential messages to include those associated with computer program components 
1E46 and 2F45, and that are stored in properties files FILE.*, FILE2.*, FILE3.* and FILE 
4*, and that exist between May 08, 2003 and May 12, 2003. If the developer is satisfied 
with the criteria that has been provided, he/she can select submit button 60 to finalize the 
same, or cancel button 62 to cancel the process. 

[0027] Referring back to Fig. 2, once the developer has provided the message analysis 
criteria, the developer can then designate desired information via information request 
system 42. Specifically, the developer will express which types of messages he/she 
wishes to be analyzed from the pool of messages, as well as the information he/she 
desires to obtain fi-om the analysis. To this extent, information request system 42 will 
typically generate an information request interface. Referring to Fig. 4, an illustrative 
information request interface 70 is shown. In general, information request interface 70 
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allows the developer to designate the types of messages he/she wishes to have analyzed 
from the pool. Under the present invention, the possible types of messages that can be 
analyzed include newly created messages, edited messages and deleted messages. For 
each type of message, the developer has the capability to designate the information he/she 
desires to receive. In a typical embodiment, the developer will use check boxes 72, 74 
and 76 or the like to designate whether he/she wishes to receive a quantity and an 
estimated line count for each type of message. For example, as shown in Fig. 4, the 
developer has checked both the quantity box and the estimated line count box for the 
"created messages" type. This means that the developer wishes to know how many new 
messages were created, and how many total lines of content were created for messages in 
the pool established by the message analysis criteria (e.g., messages associated with the 
identified computer program components, that are located within the identified properties 
files and that were created between the established date range). Similarly, both boxes 
being checked for the "edited messages" type means that the developer wishes to know 
how many messages were edited, and how many lines of content were edited in messages 
in the pool established by the message analysis criteria. Still yet, both boxes being 
checked for the "deleted messages" type means that means that the developer wishes to 
know how many messages were deleted, and how many lines of content were deleted 
from messages in the pool established by the message analysis criteria. In any event, once 
the developer has designated the information desired to be received, he/she can select 
submit button 78 to fmalize the same, or cancel button 80 to cancel the process. 
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[0028] Referring back to Fig. 2, after the desired information has been designated, 
analysis system 44 will locate/identify the applicable instances of the message types for 
which information was desired. Specifically, analysis system 44 will obtain the instances 
of the message types that are associated with the identified computer program 
components, from the identified properties files, and that are within the established time 
period (date range) for which information is desired. Using the examples set forth above, 
analysis system 44 will obtain the instances that are associated with computer program 
components 1E46 AND 2F45, that are located in properties files FILE.*, FILE2.*, 
FILE3.* and FILE 4*, and that were created, edited or deleted between May 08, 2003 and 
May 12, 2003. 

[0029] Once the appropriate instances/messages are identified, analysis system 44 will 
then analyze the same to determine the desired information. That is, analysis system 44 
will determine the quantity of messages created, edited and deleted, as well as respective 
estimate line counts for content newly added, edited and deleted. In general, the 
quantities will be computed by summing the number of instances of each message type. 
For example, if ten messages were newly created in the pool established by the message 
analysis criteria, the quantity would be ten. The estimated line count is determined by 
computing a quantity of words, and then dividing that quantity by a predetermined 
number (e.g. as stored in storage unit 30). In a typical embodiment, the predetermined 
number is ten because it is presumed that each line of content will have approximately ten 
words, although it should be understood that any predetermined quantity could be used to 
estimate the line count. Thus, for example, if there were ten newly created messages, and 
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a total of 500 words of newly created content, analysis system 44 would determine that 
there is an estimated 50 lines of new content. Similarly, if twelve messages in the 
designated pool were edited, and a total of 600 words of content were edited, analysis 
system 44 would determine that an estimated 60 lines of content had been edited. Still 
yet, if five messages in the designated pool were deleted, and a total of 300 words of 
content were deleted, analysis system 44 would determine that an estimated 30 lines of 
content had been deleted. 

[0030] Using the estimated line count, analysis system 44 could also provide a translation 
estimate for newly created and edited messages. Specifically, using a predetermined 
translation cost, analysis system 44 could compute the translation estimate based on a 
quantity of words or lines that had been created or edited. Thus, for example, if the 
estimated translation cost is $1.00 per word or $10.00 per line (using the ten word per line 
predetermined value discussed above), the cost for translating the newly created content 
in the designated pool would be $500, while the cost for translating the edited content in 
messages within the designated pool would be $600. This would yield a total estimated 
translation cost of $1 100. A translation savings or lost cost for content deleted fi-om 
messages in the designated pool could be computed in a similar fashion. For example, 
since 30 lines of content were deleted from messages in the designated pool, it could be 
presumed that an estimated total of $300 had been lost since the content was no longer 
being used. Once all desired information has been determined by analysis system 44, 
output system 46 can generate result interface 32 to provide the information to the 
developer. 
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[0031] Referring now to Fig. 5, a flow diagram 100 of a method according to the present 
invention is shown. As depicted, first step SI is to providing message analysis criteria. 
As indicated above, the message analysis criteria identifies at least one computer program 
component, at least one properties file and a predetermined time period. Second set S2 is 
to designate desired information such as a message quantity and an estimated line count 
for at least one type of message. Third step S3 is to identify instances of the at least one 
type of message based on the analysis criteria. Fourth step S4 is to analyze the instances 
to determine the desired information. As indicated above, the present invention could 
optionally provide a translation estimate in step S5 before the information is outputted in 
step S6, optionally saved in step S7 and the process ended in step S8. 
[0032] It should be understood that the present invention can be realized in hardware, 
software, or a combination of hardware and software. Any kind of computer system(s) - 
or other apparatus adapted for carrying out the methods described herein - is suited. A 
typical combination of hardware and software could be a general purpose computer 
system with a computer program that, when loaded and executed, carries out the 
respective methods described herein. Alternatively, a specific use computer, containing 
specialized hardware for carrying out one or more of the fimctional tasks of the invention, 
could be utilized. The present invention can also be embedded in a computer program 
product, which comprises all the respective features enabling the implementation of the 
methods described herein, and which - when loaded in a computer system - is able to 
carry out these methods. Computer program, software program, program, or software, in 
the present context mean any expression, in any language, code or notation, of a set of 
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instructions intended to cause a system having an information processing capability to 
perform a particular function either directly or after either or both of the following: (a) 
conversion to another language, code or notation; and/or (b) reproduction in a different 
material form. 

[0033] The foregoing description of the preferred embodiments of this invention has been 
presented for purposes of illustration and description. It is not intended to be exhaustive 
or to limit the invention to the precise form disclosed, and obviously, many modifications 
and variations are possible. Such modifications and variations that may be apparent to a 
person skilled in the art are intended to be included within the scope of this invention as 
defined by the accompanying claims. 
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